Search CORE

7 research outputs found

Automated Composition of Picture-Synched Music Soundtracks for Movies

Author: Liang Feynman
Merwe A. Van Der
Nam Hyeonseob
Wu Xiaoying
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 19/10/2019
Field of study

We describe the implementation of and early results from a system that automatically composes picture-synched musical soundtracks for videos and movies. We use the phrase "picture-synched" to mean that the structure of the automatically composed music is determined by visual events in the input movie, i.e. the final music is synchronised to visual events and features such as cut transitions or within-shot key-frame events. Our system combines automated video analysis and computer-generated music-composition techniques to create unique soundtracks in response to the video input, and can be thought of as an initial step in creating a computerised replacement for a human composer writing music to fit the picture-locked edit of a movie. Working only from the video information in the movie, key features are extracted from the input video, using video analysis techniques, which are then fed into a machine-learning-based music generation tool, to compose a piece of music from scratch. The resulting soundtrack is tied to video features, such as scene transition markers and scene-level energy values, and is unique to the input video. Although the system we describe here is only a preliminary proof-of-concept, user evaluations of the output of the system have been positive.Comment: To be presented at the 16th ACM SIGGRAPH European Conference on Visual Media Production. London, England: 17th-18th December 2019. 10 pages, 9 figure

arXiv.org e-Print Archive

Crossref

Explore Bristol Research

Dense Feature Aggregation and Pruning for RGBT Tracking

Author: Bertinetto Luca
Boyu Chen
Chenglong Li
Chenglong Li
Choi Jongwon
Conaire Ciaran O
Conaire Ciarán
Danelljan M.
Galoogahi Hamed Kiani
Jianhao Luo
Jianming Zhang
Jung Ilchae
Kim Han Ul
Lan Xiangyuan
Li Chenglong
Liu Huaping
Lukezic A.
Nam Hyeonseob
Nam Hyeonseob
Saihui Hou
Shiyi Hu
Simonyan Karen
Tao Kong
Valmadre Jack
Wu Yi
Wu Yi
Yang Li
Yuankai Qi
Zhipeng Zhang
Zhou Bolei
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 24/07/2019
Field of study

How to perform effective information fusion of different modalities is a core factor in boosting the performance of RGBT tracking. This paper presents a novel deep fusion algorithm based on the representations from an end-to-end trained convolutional neural network. To deploy the complementarity of features of all layers, we propose a recursive strategy to densely aggregate these features that yield robust representations of target objects in each modality. In different modalities, we propose to prune the densely aggregated features of all modalities in a collaborative way. In a specific, we employ the operations of global average pooling and weighted random selection to perform channel scoring and selection, which could remove redundant and noisy features to achieve more robust feature representation. Experimental results on two RGBT tracking benchmark datasets suggest that our tracker achieves clear state-of-the-art against other RGB and RGBT tracking methods.Comment: arXiv admin note: text overlap with arXiv:1811.0985

arXiv.org e-Print Archive

Crossref

Matching images and text with multi-modal tensor fusion and re-ranking

Author: Antol Stanislaw
He Kaiming
Hedi
Huang Yan
Jabri Allan
Jorge Garc'i
Karpathy A.
Kiros Ryan
Krishna Ranjay
Li Shuang
Liu Yu
Nam Hyeonseob
Niu Zhenxing
Qin Danfeng
Ren Shaoqing
Wang Liwei
Wang Shuhui
Xu Kelvin
Zhang Ying
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 15/10/2019
Field of study

A major challenge in matching images and text is that they have intrinsically different data distributions and feature representations. Most existing approaches are based either on embedding or classification, the first one mapping image and text instances into a common embedding space for distance measuring, and the second one regarding image-text matching as a binary classification problem. Neither of these approaches can, however, balance the matching accuracy and model complexity well. We propose a novel framework that achieves remarkable matching performance with acceptable model complexity. Specifically, in the training stage, we propose a novel Multi-modal Tensor Fusion Network (MTFN) to explicitly learn an accurate image-text similarity function with rank-based tensor fusion rather than seeking a common embedding space for each image-text instance. Then, during testing, we deploy a generic Cross-modal Re-ranking (RR) scheme for refinement without requiring additional training procedure. Extensive experiments on two datasets demonstrate that our MTFN-RR consistently achieves the state-of-the-art matching performance with much less time complexity.Accepted author manuscriptIntelligent System

Crossref

TU Delft Repository

The Thermal Infrared Visual Object Tracking VOT-TIR2016 Challenge Results

Author: Ahlberg Jörgen
Akin Osman
Al-Shakarji Noor
Arens Michael
Baek Mooyeol
Battistone Francesco
Becker Stefan
Berg Amanda
Bertinetto Luca
Bischof Horst
Bowden Richard
Bunyak Filiz
Cehovin Luka
Chang Chang-Ming
Danelljan Martin
Du Dawei
Eldesokey Abdelrahman
Erdem Aykut
Erdem Erkut
Fan Nana
Felsberg Michael
Feng Jiayi
Fernandez Gustavo
Gao Ke
Garcia-Martin Alvaro
Golodetz Stuart
Hadfield Simon
Han Bohyung
He Zhenyu
Hu Tao
Huang Qingming
Huebner Wolfgang
Häger Gustav
Khan Fahad Shahbaz
Krah Sebastian B.
Kristan Matej
Laganiere Robert
Lang Jochen
Lebeda Karel
Leonardis Ales
Li Hongdong
Li Shengkun
Li Wenbo
Li Xin
Li Yang
Lukezic Alan
Lyu Siwei
Maresca Mario
Martin-Nieto Rafael
Martinez Jose M.
Matas Jiri
Mauthner Thomas
Mikolajczyk Krystian
Miksik Ondrej
Nam Hyeonseob
Palaniappan Kannappan
Pelapur Rengarajan
Petrosino Alfredo
Pflugfelder Roman
Poostchi Mahdieh
Porikli Fatih
Possegger Horst
Qi Honggang
Santopietro Vincenzo
Seetharaman Guna
Solis Montero Andres
Tang Ming
Torr Philip H. S.
Valmadre Jack
Varfolomieiev Anton
Vojir Tomas
Wen Longyin
Xu Zhan
Yao Shizeng
Zhao Fei
Zhu Gao
Zhu Jianke
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

The Thermal Infrared Visual Object Tracking challenge 2016, VOT-TIR2016, aims at comparing short-term single-object visual trackers that work on thermal infrared (TIR) sequences and do not apply pre-learned models of object appearance. VOT-TIR2016 is the second benchmark on short-term tracking in TIR sequences. Results of 24 trackers are presented. For each participating tracker, a short description is provided in the appendix. The VOT-TIR2016 challenge is similar to the 2015 challenge, the main difference is the introduction of new, more difficult sequences into the dataset. Furthermore, VOT-TIR2016 evaluation adopted the improvements regarding overlap calculation in VOT2016. Compared to VOT-TIR2015, a significant general improvement of results has been observed, which partly compensate for the more difficult sequences. The dataset, the evaluation kit, as well as the results are publicly available at the challenge website

Hacettepe University Institutional Repository

Publikationer från Linköpings universitet

Crossref

Digitala Vetenskapliga Arkivet - Academic Archive On-line

The Visual Object Tracking Vot2016 Challenge Results

Author: Akin Osman
Al-Shakarji Noor
Alatan Aydin
Arens Michael
Baek Mooyeol
Bastos Guilherme
Battistone Francesco
Becker Stefan
Bertinetto Luca
Bischof Horst
Bowden Richard
Bunyak Filiz
Cai Zexiong
Cehovin Luka
Chang Chang-Ming
Chang Hyung Jin
Chen Dapeng
Chi Zhizhen
Cho Jae-il
Choi Jin Young
Choi Jongwon
Choi Sunglok
Danelljan Martin
Demiris Yiannis
Drummond Isabela
Du Dawei
Erdem Aykut
Erdem Erkut
Fan Nana
Felsberg Michael
Feng Jiayi
Fernandez Gustavo
Gao Jin
Gao Junyu
Gao Ke
Garcia-Martin Alvaro
Ghanem Bernard
Golodetz Stuart
Gundogdu Erhan
Gupta Abhinav
Hadfield Simon
Hager Gustav
Han Bohyung
He Zhenyu
Henriques Joao F.
Hu Tao
Hu Weiming
Huang Qingming
Huebner Wolfgang
Jeong Jae-chan
Jeong Jiyeoup
Kakanuru Sumithra
Khan Fahad
Khan Muhammad Haris
Kim Daijin
Kim Ji-Wan
Krah Sebastian B.
Kristan Matej
Laganiere Robert
Lan Xiangyuan
Lang Jochen
Lebeda Karel
Lee Hyemin
Lee Jae-Yeong
Leonardis Ales
Li Hongdong
Li Shengkun
Li Siyi
Li Wenbo
Li Xin
Li Yang
Liu Bin
Lu Huchuan
Lukezic Alan
Lyu Siwei
Ma Andy J.
Maresca Mario
Martin-Nieto Rafael
Martinez Brais
Martinez Jose M.
Matas Jiri
Mauthner Thomas
Medeiros Henry
Melzi Simone
Memarmoghadam Alireza
Mikolajczyk Krystian
Miksik Ondrej
Mishra Deepak
Moallem Payman
Montero Andres Solis
Mueller Matthias
Nam Hyeonseob
Palaniappan Kannappan
Pelapur Rengarajan
Petrosino Alfredo
Pflugfelder Roman
Poostchi Mahdieh
Porikli Fatih
Possegger Horst
Pridmore Tony
Qi Honggang
Qi Yuankai
Qin Lei
Rapuru Madan Kumar
Robinson Andreas
Roffo Giorgio
Santopietro Vincenzo
Seetharaman Guna
Senna Pedro
Stolkin Rustam
Subrahmanyam Gorthi R. K. Sai
Sun Chong
Tang Ming
Torr Philip H. S.
Valmadre Jack
Valstar Michel
Varfolomieiev Anton
Vedaldi Andrea
Vojir Tomas
Walsh Ryan
Wang Lijun
Wang Naiyan
Wang Xiaomeng
Wang Yifan
Wen Longyin
Xiao Jingjing
Xing Junliang
Xu Changsheng
Xu Zhan
Yao Shizeng
Yeung Dit-Yan
Yuan Zejian
Yuen Pong C.
Zhang Mengdan
Zhang Shengping
Zhang Tianzhu
Zhao Fei
Zhu Gao
Zhu Jianke
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

The Visual Object Tracking challenge VOT2016 aims at comparing short-term single-object visual trackers that do not apply pre-learned models of object appearance. Results of 70 trackers are presented, with a large number of trackers being published at major computer vision conferences and journals in the recent years. The number of tested state-of-the-art trackers makes the VOT 2016 the largest and most challenging benchmark on short-term tracking to date. For each participating tracker, a short description is provided in the Appendix. The VOT2016 goes beyond its predecessors by (i) introducing a new semi-automatic ground truth bounding box annotation methodology and (ii) extending the evaluation system with the no-reset experiment.Wo

Fraunhofer-ePrints

Catalogo dei prodotti della ricerca

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Enlighten

Surrey Research Insight

Hacettepe University Institutional Repository

Archivio della ricerca - Università degli studi di Napoli "Parthenope"

Publikationer från Linköpings universitet

Crossref

University of Surrey

Archivio della ricerca- Università di Roma La Sapienza

The Visual Object Tracking VOT2015 challenge results

Author: Alahari Karteek
Arens Michael
Baskurt Atilla
Becker Stefan
Bertinetto Luca
Bibi Adel
Bischof Horst
Bogun Ivan
Bowden Richard
Bunyak Filiz
Cehovin Luka
Chang Ming-Ching
Chen Zhaoyun
Chen Zhe
Cho Jae-il
Choi Jongwon
Choi Sunglok
Danelljan Martin
Du Dawei
Duffner Stefan
Fan Nana
Felsberg Michael
Feng Jiayi
Fernandez Gustavo
Gao Jin
Gao Ke
Garcia Christophe
Garcia-Martins Alvaro
Ghanem Bernard
Golodetz Stuart
Gupta Abhinav
Hadfield Simon
Han Bohyung
Hare Sam
Haris Khan Muhammad
He Zhenyu
Hicks Stephen L.
Hong Wong Kin
Hong Zhibin
Hu Weiming
Hua Yang
Huang Dafei
Huang Zehua
Hubner Wolfgang
Häger Gustav
Jeong Jae-chan
Jia Jiaya
Ke Lipeng
Khan Fahad
Kieritz Hilke
Kim Daijin
Kim Ji-Wan
Kristan Matej
Laganiere Robert
Lang Jochen
Lebeda Karel
Lee ByeongJu
Lee Hyemin
Lee Jae-Young
Leonardis Ales
Li Hongdong
Li Jiatong
Li Siyi
Li Xin
Li Yang
Li Yuezun
Lu Yang
Lukezic Alan
Luo Lei
Lyu Siwei
Ma Liang
Maresca Mario
Martin-Nieto Rafael
Martinez Brais
Martinez Jose M.
Matas Jiri
Mauthner Thomas
Mei Xue
Miksik Ondrej
Moujtahid Salma
Nam Hyeonseob
Nebehay Georg
Palaniappan Kannappan
Pelapur Rengarajan
Petrosino Alfredo
Pflugfelder Roman
Pootschi Mandieh
Porikli Fatih
Possegger Horst
Pridmore Tony
Prokhorov Danil
Qi Honggang
Ribeiro Eraldo
Saffari Amir
Schmid Cordelia
Seetharaman Guna
Shi Jianping
Shi Xinchu
Shizeng Yao
Solis Montero Andres
Tang Ming
Tao Dacheng
Torr Philip H. S.
Tuen Yau Hing
Valstar Michel
Varfolomieiev Anton
Vojir Tomas
Wang Chaohui
Wang Naiyan
Wang Qiang
Wang Xiaomeng
Wen Longyin
Wen Mei
Wu Tianfu
Xing Junliang
Xue Kai
Yeung Dit-Yan
Young Choi Jin
Yun Kimin
Zhang Chunyuan
Zhang Mengdan
Zhang Zhe
Zhao Baojun
Zhao Xu
Zhu Gao
Zhu Jianke
Zhu Song-Chun
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

The Visual Object Tracking challenge 2015, VOT2015, aims at comparing short-term single-object visual trackers that do not apply pre-learned models of object appearance. Results of 62 trackers are presented. The number of tested trackers makes VOT 2015 the largest benchmark on short-term tracking to date. For each participating tracker, a short description is provided in the appendix. Features of the VOT2015 challenge that go beyond its VOT2014 predecessor are: (i) a new VOT2015 dataset twice as large as in VOT2014 with full annotation of targets by rotated bounding boxes and per-frame attribute, (ii) extensions of the VOT2014 evaluation methodology by introduction of a new performance measure. The dataset, the evaluation kit as well as the results are publicly available at the challenge website(1)

Hal - Université Grenoble Alpes

HAL Clermont Université

Fraunhofer-ePrints

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Hal-Diderot

Publikationer från Linköpings universitet

HAL-IN2P3

INRIA a CCSD electronic archive server